Picture for Yifan Li

Yifan Li

$τ_0$-WM: A Unified Video-Action World Model for Robotic Manipulation

Add code
May 31, 2026
Viaarxiv icon

MBench: A Comprehensive Benchmark on Memory Capability for Video World Models

Add code
May 30, 2026
Viaarxiv icon

Spatially Prompted Visual Trajectory Prediction for Egocentric Manipulation

Add code
May 19, 2026
Viaarxiv icon

Adaptive Context Matters: Towards Provable Multi-Modality Guidance for Super-Resolution

Add code
May 11, 2026
Viaarxiv icon

Meta-LegNet: A Transferable and Interpretable Framework for Surface Adsorption Prediction via Self-Defined Adsorption-Environment Learning

Add code
May 03, 2026
Viaarxiv icon

PepSpecBench: A Unified Evaluation Benchmark for Peptide Tandem Mass Spectrometry Prediction

Add code
May 03, 2026
Viaarxiv icon

Improving Vision-language Models with Perception-centric Process Reward Models

Add code
Apr 27, 2026
Viaarxiv icon

VibeFlow: Versatile Video Chroma-Lux Editing through Self-Supervised Learning

Add code
Apr 15, 2026
Viaarxiv icon

Characterizing Lidar Range-Measurement Ambiguity due to Multiple Returns

Add code
Apr 10, 2026
Viaarxiv icon

Doctor-RAG: Failure-Aware Repair for Agentic Retrieval-Augmented Generation

Add code
Apr 01, 2026
Viaarxiv icon